Search CORE

497 research outputs found

Using Experts for Predicting Continuous Outcomes

Author: J. Kivinen
M. Warmuth
Publication venue
Publication date
Field of study

The perceptron algorithm versus winnow: linear versus logarithmic mistake bounds when few input variables are relevant

Author: Kivinen J.
Warmuth M.K.
Auer P.
Publication venue: Published by Elsevier B.V.
Publication date: 01/01/1905
Field of study

AbstractWe give an adversary strategy that forces the Perceptron algorithm to make Ω(kN) mistakes in learning monotone disjunctions over N variables with at most k literals. In contrast, Littlestone's algorithm Winnow makes at most O(k log N) mistakes for the same problem. Both algorithms use thresholded linear functions as their hypotheses. However, Winnow does multiplicative updates to its weight vector instead of the additive updates of the Perceptron algorithm. In general, we call an algorithm additive if its weight vector is always a sum of a fixed initial weight vector and some linear combination of already seen instances. Thus, the Perceptron algorithm is an example of an additive algorithm. We show that an adversary can force any additive algorithm to make (N + k −1)2 mistakes in learning a monotone disjunction of at most k literals. Simple experiments show that for k ⪡ N, Winnow clearly outperforms the Perceptron algorithm also on nonadversarial random data

Elsevier - Publisher Connector

Crossref

Galiciana

Hedging structured concepts

Author: Kivinen J.
Koolen W.M.
Warmuth M.K.
Publication venue: Omnipress
Publication date: 01/01/2010
Field of study

International Migration, Integration and Social Cohesion online publications

Intervalley-Scattering Induced Electron-Phonon Energy Relaxation in Many-Valley Semiconductors at Low Temperatures

Author: A. Savin
A. B. Pippard
J. Ahopelto
M. Prunnila
M. Prunnila
P. Kivinen
P. Törmä
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2005
Field of study

We report on the effect of elastic intervalley scattering on the energy transport between electrons and phonons in many-valley semiconductors. We derive a general expression for the electron-phonon energy flow rate at the limit where elastic intervalley scattering dominates over diffusion. Electron heating experiments on heavily doped n-type Si samples with electron concentration in the range

3.5-16.0\times 10^{25}

^{-3}

are performed at sub-1 K temperatures. We find a good agreement between the theory and the experiment.Comment: v2: Notations changed:

\Delta_i

-->

\delta v_i

\tau_{eff}

removed. Eq. (1) changed, Eq. (2) added and complete derivation of Eq. (3) included. Some further discussion about single vs. many valley added [3rd paragraph after Eq. (7)]. End notes removed and new reference added [Kragler and Thomas]. Typos in references correcte

arXiv.org e-Print Archive

Crossref

VTT Research System

Hedging structured concepts

Author: Kivinen J.
Koolen W.M.
Warmuth M.K.
Publication venue: Omnipress
Publication date: 01/01/2010
Field of study

International Migration, Integration and Social Cohesion online publications

Competing with stationary prediction strategies

Author: A. DeSantis
G. Gruenhage
G. Shafer
G.H. Hardy
J. Kivinen
J. Kivinen
J.F. Hannan
N. Cesa-Bianchi
N. Cesa-Bianchi
N. Littlestone
P. Auer
P. Billingsley
V. Vovk
V. Vovk
V. Vovk
V.N. Vapnik
W. Rudin
Y. Kalnishkan
Publication venue
Publication date: 13/07/2006
Field of study

In this paper we introduce the class of stationary prediction strategies and construct a prediction algorithm that asymptotically performs as well as the best continuous stationary strategy. We make mild compactness assumptions but no stochastic assumptions about the environment. In particular, no assumption of stationarity is made about the environment, and the stationarity of the considered strategies only means that they do not depend explicitly on time; we argue that it is natural to consider only stationary strategies even for highly non-stationary environments.Comment: 20 page

arXiv.org e-Print Archive

Royal Holloway Research Online

Crossref

Royal Holloway - Pure

On-line PCA with Optimal Regrets

Author: A.T. Kalai
D.P. Helmbold
J. Kivinen
K. Tsuda
K.S. Azoury
M. Herbster
M.K. Warmuth
M.K. Warmuth
N. Cesa-Bianchi
N. Cesa-Bianchi
N. Cesa-Bianchi
Publication venue
Publication date: 01/01/2013
Field of study

We carefully investigate the on-line version of PCA, where in each trial a learning algorithm plays a k-dimensional subspace, and suffers the compression loss on the next instance when projected into the chosen subspace. In this setting, we analyze two popular on-line algorithms, Gradient Descent (GD) and Exponentiated Gradient (EG). We show that both algorithms are essentially optimal in the worst-case. This comes as a surprise, since EG is known to perform sub-optimally when the instances are sparse. This different behavior of EG for PCA is mainly related to the non-negativity of the loss in this case, which makes the PCA setting qualitatively different from other settings studied in the literature. Furthermore, we show that when considering regret bounds as function of a loss budget, EG remains optimal and strictly outperforms GD. Next, we study the extension of the PCA setting, in which the Nature is allowed to play with dense instances, which are positive matrices with bounded largest eigenvalue. Again we can show that EG is optimal and strictly better than GD in this setting

arXiv.org e-Print Archive

CiteSeerX

Crossref

Inhibition in multiclass classification

Author: Bottou L.
Chang Y.-W.
Charles Elkan
Dempster A. P.
José M. Amigó
Kivinen J.
LeCun Y.
Lugosi G.
Platt J. C.
Platt J. C.
Ramón Huerta
Rifkin R.
Shankar Vembu
Smith B. H.
Tewari A.
Thomas Nowotny
Tsochantaridis I.
Weston J.
Publication venue: 'MIT Press - Journals'
Publication date: 01/09/2012
Field of study

The role of inhibition is investigated in a multiclass support vector machine formalism inspired by the brain structure of insects. The so-called mushroom bodies have a set of output neurons, or classification functions, that compete with each other to encode a particular input. Strongly active output neurons depress or inhibit the remaining outputs without knowing which is correct or incorrect. Accordingly, we propose to use a classification function that embodies unselective inhibition and train it in the large margin classifier framework. Inhibition leads to more robust classifiers in the sense that they perform better on larger areas of appropriate hyperparameters when assessed with leave-one-out strategies. We also show that the classifier with inhibition is a tight bound to probabilistic exponential models and is Bayes consistent for 3-class problems. These properties make this approach useful for data sets with a limited number of labeled examples. For larger data sets, there is no significant comparative advantage to other multiclass SVM approaches

Crossref

PubMed Central

Sussex Research Online